HMM adaptation and microphone array processing for distant speech recognition
نویسندگان
چکیده
Connected strings of seven digits from the TIDIGITS database were recorded in a reverberant office room for evaluation using microphone array processing and HMM, Hidden Markov Model, adaptation. A sixteen-channel linear microphone array records a distance speech database useful for further experimentation. The adaptation techniques of Parallel Model Combination (PMC) and Maximum Likelihood Linear Regression (MLLR) are evaluated and compared. The effect of the number of adaptation utterances and number of vectors per class for the regression tree in order to optimize MLLR results are studied. Results show, compared to no adaptation, 40% word error reduction (improvement to 4.2%) for PMC and 60% word error reduction (improvement to 3.0%) for MLLR.
منابع مشابه
Speech recognition in a reverberant environment using matched filter array (MFA) processing and linguistic-tree maximum likelihood linear regression (LT-MLLR) adaptation
Performance of automatic speech recognition systems trained on close talking data su ers when used in a distant talking environment due to the mismatch in training and testing conditions Microphone array sound capture can reduce some mismatch by removing ambi ent noise and reverberation but o ers insu cient im provement in performance However using array sig nal capture in conjunction with Hidd...
متن کاملExperiments of Speech Recognition in a Noisy and Reverberant Environment Using a Microphone Array and Hmm Adaptation
REVERBERANT ENVIRONMENT USING A MICROPHONE ARRAY AND HMM ADAPTATION D. Giuliani, M. Omologo and P. Svaizer IRST Istituto per la Ricerca Scientica e Tecnologica I 38050 Povo;Trento; ITALY ABSTRACT The use of a microphone array for hands-free continuous speech recognition in noisy and reverberant environment is investigated. An array of four omnidirectional microphones is placed at 1.5 m distance...
متن کاملHands-free speech recognition using a filtered clean corpus and incremental HMM adaptation
A challenging scenario is addressed in which a hands-free speech recognizer operates in a noisy ooce environment with incremental model adaptation functionalities. The use of a single far microphone as well as that of a microphone array input are investigated. In a previous work it was shown that the acoustic mis-match, remaining after the application of microphone array processing, can be furt...
متن کاملRobust Hmm Training and Adaptation in Hands-free Speech Recognition
In this paper a challenging scenario is addressed in which a hands-free speech recognizer operates in a noisy office environment with either batch or incremental model adaptation. The application of a microphone array processing compensates only for part of the mismatch between training and testing acoustic conditions. In a previous work it was shown that the acoustic mismatch can be further re...
متن کاملHMM-separation-based speech recognition for a distant moving speaker
This paper presents a hands-free speech recognition method based on HMM composition and separation for speech contaminated not only by additive noise but also by an acoustic transfer function. The method realizes an improved user interface such that a user is not encumbered by microphone equipment in noisy and reverberant environments. The use of HMM composition has already been proposed for co...
متن کامل